Stochastic optimization

Results: 750



#Item
121Machine learning / Multi-armed bandit / Stochastic optimization / Algorithm / Mathematics / Academia / Applied mathematics

A Relative Exponential Weighing Algorithm for Adversarial Utility-based Dueling Bandits Pratik Gajane Tanguy Urvoy Fabrice Cl´erot

Add to Reading List

Source URL: jmlr.org

Language: English - Date: 2015-09-16 19:38:44
122Machine learning / Multi-armed bandit / Stochastic optimization / Markov models / Active learning / Dynamic programming / Reinforcement learning / Variance

multi-bandit_techreport.dvi

Add to Reading List

Source URL: victorgabillon.nfshost.com

Language: English - Date: 2011-11-19 09:10:16
123Numerical analysis / Machine learning / Applied mathematics / Online machine learning / Mathematical optimization / Gradient descent / Statistical classification / Computational statistics / Artificial neural networks / Support vector machine / Stochastic gradient descent

WIC Wintermeeting, February 1, 2016 From Data Compression to Online Machine Learning Tim van Erven

Add to Reading List

Source URL: www.timvanerven.nl

Language: English - Date: 2016-02-01 08:21:05
124Mathematics / Dynamic programming / Mathematical analysis / Stochastic control / Discrete mathematics / Algebra / Distribution / Number theory / Markov decision process / Prime-counting function / Mathematical optimization / Projection-valued measure

Finite-Sample Analysis in Reinforcement Learning Mohammad Ghavamzadeh INRIA Lille – Nord Europe, Team SequeL Outline

Add to Reading List

Source URL: mistis.inrialpes.fr

Language: English - Date: 2011-12-05 01:35:36
125Statistics / Machine learning / Multi-armed bandit / Stochastic optimization / Bandit / Variance

Multi-Bandit Best Arm Identification V. Gabillon, M. Ghavamzadeh, A. Lazaric & S. Bubeck Sequel Group Meeting, 21 octobre, 2011. An Example

Add to Reading List

Source URL: victorgabillon.nfshost.com

Language: English - Date: 2011-10-25 11:18:31
126Mathematical analysis / Dynamic programming / Markov processes / Stochastic control / Statistical inference / Analysis / Operations research / Markov decision process / Regret / Distribution / S0 / Mathematical optimization

Optimism in Sequential Decision-Making under Uncertainty Peter Bartlett Department of Statistics and Division of Computer Science UC Berkeley

Add to Reading List

Source URL: www.stat.berkeley.edu

Language: English - Date: 2007-08-23 19:31:01
127Mathematical optimization / Operations research / Mathematical analysis / Numerical analysis / Convex optimization / Stochastic optimization / Multi-armed bandit / Game theory / Linear programming / AMPL

CSStat 260, Fall 2014: Learning in Sequential Decision Problems Lectures: Evans 334. Tuesday/Thursday 2:00-3:30. Instructor: Peter Bartlett http://www.stat.berkeley.edu/∼bartlett

Add to Reading List

Source URL: www.stat.berkeley.edu

Language: English - Date: 2014-08-28 11:49:09
128Markov processes / Mathematics / Probability theory / Mathematical analysis / Dynamic programming / Markov decision process / Stochastic control / Markov chain / Mathematical optimization / Distribution

Stat 260/CSLearning in Sequential Decision Problems. Peter Bartlett 1. Recall: MDPs. 2. Value iteration. 3. Policy iteration.

Add to Reading List

Source URL: www.stat.berkeley.edu

Language: English - Date: 2014-11-25 12:45:38
129Mathematical analysis / Machine learning / Learning / Statistics / Operations research / Statistical classification / Stochastic optimization / Convex optimization / Mathematical optimization / Support vector machine / Distribution / Stochastic gradient descent

Optimizing Non-decomposable Performance Measures: A Tale of Two Classes Harikrishna Narasimhan∗ Indian Institute of Science, Bangalore, INDIA HARIKRISHNA @ CSA . IISC . ERNET. IN

Add to Reading List

Source URL: jmlr.org

Language: English - Date: 2015-09-16 19:38:47
130Markov processes / Dynamic programming / Probability theory / Stochastic control / Mathematical analysis / Probability / Markov models / Markov decision process / Mathematical optimization / Markov chain / Bellman equation / X0

Stat 260/CSLearning in Sequential Decision Problems. Peter Bartlett 1. Markov decision processes and partially observable Markov decision processes. 2. Value functions, Q functions.

Add to Reading List

Source URL: www.stat.berkeley.edu

Language: English - Date: 2014-11-25 12:45:37
UPDATE